Backport to 2.17.x: #7566: Improve transaction check in refresh_cagg #7593
+301
−11
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This is an automated backport of #7566: Improve transaction check in refresh_cagg.
The original issue is #6533.
This PR will be merged automatically after all the relevant CI checks pass. If this fix should not be backported, or will be backported manually, just close this PR. You can use the backport branch to add your changes, it won't be modified automatically anymore.
For more details, please see the documentation
Original description
Improve transaction check in refresh_cagg
Intro: Hi, I was investigating an issue with
portal snapshots (0) did not account for all active snapshots (1)
error inside another Postgres extension (wdb-97, unrelated to Timescale) and stumbled upon the #6533 issue in Timescale that was reporting the same error. Decided to have a deeper look into it.This PR allows better error messages to be reported from
refresh_continuous_aggregate
when it is called from an atomic (no transaction allowed) context. One of the following messages:ERROR: refresh_continuous_aggregate() cannot run inside a transaction block
ERROR: refresh_continuous_aggregate() cannot be executed from a function
is reported now instead of:
ERROR: portal snapshots (N) did not account for all active snapshots (N+1)
. There are no other changes torefresh_continuous_aggregate
logic.Longer description, also included in
process_utility.h
:Procedures that use multiple transactions cannot be run in a transaction block (from a function, from dynamic SQL) or in a subtransaction (from a procedure block with an
EXCEPTION
clause). Such procedures use PreventInTransactionBlock function to check whether they can be run.Though currently such checks are incomplete, because
PreventInTransactionBlock
requiresisTopLevel
argument to throw a consistent error when the call originates from a function. ThisisTopLevel
flag (that is a bit poorly named - see below) is not readily available inside C procedures. The source of truth for it -ProcessUtilityContext
parameter is passed to ProcessUtility hooks, but is not included with the function calls. There is an undocumented SPI_inside_nonatomic_context function, that would have been sufficient forisTopLevel
flag, but it currently returns false when SPI connection is absent (that is a valid scenario when C procedures are called from top-level SQL instead of PLPG procedures orDO
blocks) so it cannot be used.To work around this the value of
ProcessUtilityContext
parameter is saved when TS ProcessUtility hook is entered and can be accessed from C procedures using newts_process_utility_is_context_nonatomic
function. The result is called "non-atomic" instead of "top-level" because the way how isTopLevel flag is determined from the ProcessUtilityContext value in standard_ProcessUtility is insufficient for C procedures - it excludesPROCESS_UTILITY_QUERY_NONATOMIC
value (used when called from PLPG procedure without anEXCEPTION
clause) that is a valid use case for C procedures with transactions. See details in the description of ExecuteCallStmt function.It is expected that calls to C procedures are done with
CALL
and always pass though theProcessUtility
hook. TheProcessUtilityContext
parameter is set toPROCESS_UTILITY_TOPLEVEL
value by default. In unlikely case when a C procedure is called without passing throughProcessUtility
hook and the call is done in atomic context, thenPreventInTransactionBlock
checks will pass, but SPI_commit will fail when checking that all current active snapshots are portal-owned snapshots (the same behaviour that was observed before this change). In atomic context there will be an additional snapshot set in_SPI_execute_plan
, see the snapshot handling invariants description in that function.With initial version of this PR, in TS
ProcessUtility
hook the savedProcessUtilityContext
value is reset back toPROCESS_UTILITY_TOPLEVEL
on normal exit but is NOT reset in case ofereport
exit. C procedures can callts_process_utility_context_reset
function to reset the saved value before doing the checks that can result inereport
exit. The scenario when more thorough reset may be necessary - when subsequent calls after the failed atomic call are not passed through theProcessUtility
hook - seems to be unlikely.Closes
#6533.Disable-check: commit-count